An Inverse Method for Policy-Iteration Based Algorithms
نویسندگان
چکیده
منابع مشابه
An Inverse Method for Policy-Iteration Based Algorithms
We present an extension of two policy-iteration based algorithms on weighted graphs (viz., Markov Decision Problems and Max-Plus Algebras). This extension allows us to solve the following inverse problem: considering the weights of the graph to be unknown constants or parameters, we suppose that a reference instantiation of those weights is given, and we aim at computing a constraint on the par...
متن کاملAnalysis of Classification-based Policy Iteration Algorithms Analysis of Classification-based Policy Iteration Algorithms
We introduce a variant of the classification-based approach to policy iteration which uses a cost-sensitive loss function weighting each classification mistake by its actual regret, i.e., the difference between the action-value of the greedy action and of the action chosen by the classifier. For this algorithm, we provide a full finite-sample analysis. Our results state a performance bound in t...
متن کاملAnalysis of Classification-based Policy Iteration Algorithms
We introduce a variant of the classification-based approach to policy iteration which uses a cost-sensitive loss function weighting each classification mistake by its actual regret, that is, the difference between the action-value of the greedy action and of the action chosen by the classifier. For this algorithm, we provide a full finite-sample analysis. Our results state a performance bound i...
متن کاملAlgorithms and Bounds for Sampling-based Approximate Policy Iteration *
Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervised learning problem, have been proposed recently. Finding good policies with such methods requires not only an appropriate classifier, but also reliable examples for the best actions, covering all of the state space. One major ques...
متن کاملAn inverse iteration method for eigenvalue problems with eigenvector nonlinearities
Abstract. Consider a symmetric matrix A(v) ∈ Rn×n depending on a vector v ∈ Rn and satisfying the property A(αv) = A(v) for any α ∈ R\{0}. We will here study the problem of finding (λ, v) ∈ R × Rn\{0} such that (λ, v) is an eigenpair of the matrix A(v) and we propose a generalization of inverse iteration for eigenvalue problems with this type of eigenvector nonlinearity. The convergence of the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Electronic Proceedings in Theoretical Computer Science
سال: 2009
ISSN: 2075-2180
DOI: 10.4204/eptcs.10.4